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Abstract. We consider distance queries in vertex labeled planar graphs. 
For any fixed 0 < e < 1/2 we show how to preprocess an undirected pla¬ 
nar graph with vertex labels and edge lengths to answer queries of the fol¬ 
lowing form. Given a vertex u and a label A return a (l-l-e)-approximation 
of the distance between u and its closest vertex with label A. The prepro¬ 
cessing time is 0{e~^n\g^ n), the required space is 0(e“^nlgn), and the 
query time is 0(lg Ig n-|-e~^). For a directed planar graph with arc lengths 
bounded by N, the preprocessing time is 0(e“^n Ig® n Ig(nA’)), the space 
is 0(e“^n Ig n lg{nN)), and the query time is 0(lg Ig n Ig Ig (nN) + e~^). 


1 Introduction 

Imagine you are driving your car and suddenly see you are about to run out of 
gas. What should you do? Obviously, you should find the closest gas station. This 
is the vertex-to-label distance query problem. Various software applications like 
Waze and Google maps attempt to provide such a functionality. The idea is to 
preprocess the locations of service providers, such as gas stations, hospitals, pubs 
and metro stations in advance, so that when a user, whose location is not known 
a priori, asks for the distance to the closest service provider, the information can 
be retrieved as quickly as possible. 

We study this problem from a theoretical point of view. We model the net¬ 
work as a planar graph with labeled vertices (e.g., a vertex labeled as a gas 
station). We study distance oracles for such graphs. A vertex-label distance ora¬ 
cle is a data structure that represents the input graph and can be queried for the 
distance between any vertex and the closest vertex with a desired label. We con¬ 
sider approximate distance oracles, which, for any given fixed parameter e > 0, 
return a distance estimate that is at least the true distance queried, and at most 
(1 -I- e) times the true distance (this is known as a (1 -I- e)-stretch). One would 
like an oracle with the following properties; queries should be answered quickly, 
the oracle should consume little space, and the construction of the oracle should 
take as little time as possible. We use the notation {0{S{n))space ? 0{T{n))time) 
to express the space requirement and query time of a distance oracle. 

The vertex-to-label distance query problem was introduced by Hermelin, 
Levy, Weimann and Yuster [6]. For any integer k > 2, they gave a (4fc — 
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5)-stretch space i 0{k)time) vertex-label distance oracle (expected 

space) for undirected general (i.e., non-planar) graphs. This is not efficient when 
the number I of distinct labels is They also presented a (2^ — l)-stretch 

{0{knl^/^)space ) 0{k)time) Undirected oracle, and showed how to maintain la¬ 
bel changes in sub-linear time. Chechik [4] improved the latter two results to 
(4/c — 5)-stretch and similar space/time bounds. 

For planar graphs, the only vertex-label distance oracle we are aware of was 
described by Li, Ma and Ning [10]. They construct a (1 -|- e)-stretch oracle with 
{0{e~^n\gn) space j 0{e~^ \gn\g p)time) bounds for undirected graphs. Here, p 
is the radius of the graph, which can be 0{n). It is also shown in [10] how to 
avoid the Igp factor when p = O(lgn). The construction time of their oracle is 
0(e“^n log^ n). 

Our results and approach We give a (1 -I- e)-stretch (0(e“^n Ig n)space i 
0(lglgn -|- t~^)time) vertex-label distance oracle for undirected planar graphs 
that can be constructed in 0(e“^nlg^n) time. This improves over the query 
time of [10] by roughly a log^ n factor. For directed planar graphs we give a 
(l-ke)-stretch {0{€~^n\gn\g{nN))space , 0{\g\gn\g\g {nN) + e~^)tim.e) vertex- 
label distance oracle whose construction time is 0(e“^n Ig^ n Ig(niV)). To the 
best of our knowledge, no non-trivial directed vertex-label distance oracles were 
proposed prior to the current work. 

Consider a vertex-to-vertex distance oracle for a graph with label set L. If 
the oracle works for general directed graphs then the vertex-to-label problem 
can be solved easily; add a distinct apex v\ for each label X G L, and connect 
every A-labeled vertex to v\ with a zero length arc. Finding the distance from a 
vertex u to label A is now equivalent to finding the distance between u and vx. 
This approach presents two main difficulties when designing efficient oracles for 
planar graphs. First, adding apices breaks planarity. In particular, it affects the 
separability of the graph. Thus, the reduction does not work with oracles that 
depend on planarity or on the existence of separators, which are more efficient 
than oracles for general graphs. Second, the reduction uses directed arcs, so it 
is unsuitable for oracles for undirected graphs. Using arcs in the reduction is 
crucial since connecting an apex with undirected zero length edges changes the 
distances in the graph. This is because the apex can be used to teleport between 
vertices with the same label. ^ 

We nonetheless use this approach, and show how to overcome these obsta¬ 
cles. We augment a directed and an undirected variants of a distance oracle of 
Thorup [12] for planar graphs. These oracles rely on the existence of fundamen¬ 
tal cycle separators in planar graphs, a property that breaks when apices are 
added to the graph. However, we observe that once the graph is separated, Tho- 
rup’s oracle does not depend on planarity. We therefore postpone the addition 
of the apices till a later stage in the construction of the distance oracle, when 

^ Teleporting between vertices might be desirable in some applications. For example, 
calculating the walking distance between two locations without accounting traveling 
between train stations. 
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the graph has already been separated. We show that, nonetheless, approximate 
distances from any vertex to any label in the entire graph can be approximated. 
Moreover, we observe that Thorup’s undirected oracle internally uses the same 
directed structures as in his directed oracle. It only depends on the undirect¬ 
edness in making the number and sizes of these structures smaller than in the 
directed case. We extend this argument to handle vertex labels. 

Additional Related Work We summarize related work on approximate vertex- 
vertex distance oracles. For general graphs, no efficient (2)-stretch approximate 
vertex-vertex distance oracles are known to date. Thorup and Zwick [13] pre¬ 
sented a (2fc— l)-stretch space i 0{k)time) undirected distance oracle 

which is constructed in 0{km'n}/^) time. Wulff-Nilsen [14] achieved the same 

result with preprocessing of 0(kn for universal constant c. Several more 
improvements of [13] have been found for unweighted or sparse graphs ([1], [2], 

[3]). 

In contrast, vertex-vertex oracles for planar graphs with stretch less then 2 
have been constructed. Thorup [12] gave a {0{e~^n\gn\g{nN))space i 0(lglg {nN)+ 
i~^)time) stretch (1 -|- e) directed distance oracle, and a {0{e~^nlgn)space ) 
0(e~^)time) undirected (simplified) distance oracle. Our result is base on Tho¬ 
rup’s oracles, which are described in Section 3. Klein [9] independently gave an 
undirected distance oracle with same bounds. Kawarabayashi, Klein and Som¬ 
mer [7] have shown a {0{n)space > 0(e“^ lg~^(?r))time) undirected (1 -|-e)-stretch 
distance oracle constructed in 0(n Ig^ n) time, inspired by [12]. [7] give a trade-off 
of {0{-—space ) 0(r -I- y/re~^\gn)time) Oracle algorithms. Kawarabayashi 
et al. [8] have shown better tradeoffs for undirected oracles. For the case where 
N G poly{n), they achieve {O* {n\gn)space , 0*(e“^)tzme) oracle, where O* hides 
lg(e“^) and lg*(n) factors. 

Roadmap In this extended abstract we focus on the undirected case. The 
remainder of this paper is organized as follows. We describe the scheme of the 
vertex-to-vertex distance oracle of Thorup in Section 3. In Section 4, we describe 
a vertex-labeled oracle for undirected planar graphs. Our description goes into 
some of details that cannot be cited and used from [12] due to a minor flaw in the 
treatment of the undirected case in [12]. In Appendix B we elaborate on the flaw 
in [12] and explain how to it is corrected. Due to space constraints, our vertex- 
labeled distance oracle for directed planar graphs is described in Appendix C. 

Its construction is similar to the undirected oracle, but relies on some additional 
reductions from [12] that we use without change. 

2 Preliminaries 

Let V (G) denote the vertex set of a graph G. We use the terms arcs and edges 
to distinguish directed and undirected graphs. Let A{G) {E{G)) denote the arc 
(edge) set of a directed (undirected) graph. We denote the concatenation of two 
paths Pi and P 2 that share an endpoint by Pi o P 2 . 
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For a simple path Q and a vertex set U C V{Q), we define Q, the reduction 
of Q to {7 as follows. Repeatedly apply the following procedure to Q. Let wv be 
an edge of Q s.t. v ^ U. Contract wv, and add the length of wv to the length of 
the other edge of Q incident to w, if such one exists. Note that |R(Q)| = 0{\U\). 

Let T be a rooted spanning tree of a graph G. For u G V{G), let T[u] denote 
the unique root-to-M path in T. The fundamental cycle of e = (ui,U 2 ) ^ E{T) 
is the undirected cycle composed of E[T[ui]), E{T[u 2 \), and e. 

Let L = be a set of I labels. A vertex-labeled graph is a graph 

G = {V, A), equipped with a function f :V ^ L. Let V\ = {v gV{G)\ f{v) = A} 
to be set of vertices with label A. 

Let G be a graph with arc lengths. For u,v G V{G), let 6c{u,v) denote the 

u-to-v distance in G. For a vertex-labeled G, we define Sg{u, A) = min 6g{u, w). 

w£Vx 

We assume basic familiarity with planar graphs. In particular, it is well known 
that if G is planar then |A(G)| = 0(|y(G)|), and that a simple cycle separates 
a planar graph G into an interior and an exterior parts. 

A vertex-label distance oracle is a data structure that, given a vertex v G V 
and a label X G L, outputs an (approximation) of Sg(v,X). We note that this 
problem is a generalization of the basic distance oracle problem in which each 
vertex is given a unique label. Constructing an 0(nZ)-space vertex-label distance 
oracle is trivial. Simply precompute and store the distance between each vertex 
and each possible label. The goal is, therefore, to devise an oracle which requires 
substantially less than nl space, while allowing for fast queries. 

3 Thorup’s Approximate Distance Oracle 

In this section we outline the distance oracle of Thorup [12]. This is necessary for 
understanding our results. The oracle we describe differs from the original in [12] 
in some of the details. See Appendix B for an explanation of the differences. 


3.1 e-covering sets 

The main idea is to store just a subset of the pairwise distances in the graph, 
from which all distances can be approximately computed efficiently. Given an 
undirected graph H, and a shortest path Q G H , Thorup shows that for every 
vertex v G H, there exists a set of 0{e~^) vertices on Q, called connections, such 
that the distances (called connection lengths) between every vertex of H and its 
connections on Q can be used to approximate, in 0(e“^) time, the length of any 
shortest path in El that intersects Q. Thorup essentially proves the following: ^ 

Lemma 1. Let Q he a shortest path in an undirected graph El. There exist sets 
C{u,Q) of 0{e~^) vertices of Q for all u G H, where: 

E G(u, Q) are called the connections of it on Q. 

^ While Lemma 1 true, we believe there is a flaw in the arguments in [12], see Ap¬ 
pendix B. 
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2. The distance between u and a connection q G C{u, Q) is called the connection 
length of u and q. 

3. For every u,w G H, if a shortest u-to-w path in H intersects Q, then 

5h^'-{u,w) < (1 + e)5H{u,w). 

Here -ffg™ is the graph with vertices u, w, and the vertices of the reduction 
of Q to C{u^Q)UC(w^Q), and with u-to-Q and Q-to-w edges whose lengths 
are the corresponding connection lengths of C{u,Q) and C{w,Q). 

Note that, since for every v \C{v,Q)\ = computing 6h^^{u,w) can be 

done in time that only depends on (in fact in 0{e~^) time). 

In the remainder of this subsection we prove Lemma 1. 

For efficiency reasons, instead of storing exact connection lengths the 

algorithm computes approximate connection lengths, which we denote by £{■■,■)■ 
This following definition captures the intuitive idea that if a v-to-q path that 
goes through q* is not too much longer than the shortest v-to-q path, then it 
suffices to store the distance from v to q* and the distance from q* to q. 

Definition 1. q* e-covers 9 w.r.t. v if £{v,q*) + 5{q*,q) < (1-b e)(5(u, g). 
Thorup [12] uses a different notion of covering.^ 

Definition 2. q* quasi e-covers <7 w.r.t. v ifi{v,q*)-\-S{q*,q) < 6{v,q)-\-el{v,q*). 

Let Q be a path. A set C of vertices of Q is a (quasi)-e-covering of Q w.r.t. 
V if for every q G Q there is a connection q* G C that (quasi)-e-covers q w.r.t. v. 

A covering set is called clean if it is inclusion-wise minimal and ordered if it 
is sorted by the order of connections along the path Q. Observe that by keeping 
the distance of every q G Q from the first vertex of Q, allows computing SQ{q, q') 
for any q,q' G Q in constant time. 

The notions of e-covering sets and quasi-e-covering sets are related by the 
following proposition: 

Proposition 1. Let C{v,Q) be a quasi e-covering set. For any 0 < e < 1/2, 
C{v,Q) is a 2e-covering set. 

Proof. Itq* quasi e-covers <7 then ri) < j^5{q,v) < 26{q,v). Hence ( 5 (( 7 , g*)-!- 
< d{q, v) -b ei{q*, u) < (1 -b 2e)5{q, v). Therefore, if C{v, Q) is a quasi e- 
covering set, it is a 2 e-covering set. □ 

The following lemma shows that, in order to prove Lemma 1, it is suffices 
that the sets C{v, Q) be e-covering sets of size 0{e~^). 

Lemma 2. ( [9, Lemma 4-lf^) u,w be vertices in an undirected graph H. 
Let Q be a shortest path in H such that a u-to-w shortest path intersects Q. Let 
C{u,Q),C{w^Q) be e-covering sets of Q w.r.t. u,w, respectively. Let H'^ be as 
in the statement of Lemma 1. Then, 

5h^'-{u,w) < {l-\-e)5H{u,w) ( 1 ) 

^ The term quasi-e-cover is not used by Thorup. He uses e-covers for this notion. 

^ Klein showed this lemma for e-covering sets, while Thorup showed a similar lemma 
using a different notion of e-covering sets. 
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Thorup shows how to efficiently construct quasi-e-covering sets. Let Q be a 
shortest path in an undirected graph H. Let sssp{Q, H) be the smallest num¬ 
ber s.t. for any subgraph Hq of H, and any vertex q S Qq, where Qq is the 
reduction of Q to Hq, we can compute single source shortest paths from q 
in the graph Qq U Hq in 0{sssp{Q, H)\E{Ho)\) time. It is easy to see that a 
standard implementation of Dijkstra’s algorithm with priority queues implies 
sssp{Q,H) = 0{lg\E{H)\). If E{ is planar, then sssp{Q,E{) = 0(1) by [5]. 

Lemma 3. ([12, Lemma 3.18]) Given an undirected graph E[ and shortest path 
Q, quasi e-covering sets of Q with respect to all vertices of H, each of size 
0{e~^ \gn), can be constructed in 0{e~^sssp{Q, H)\E{E[)\\g{\V{Q)\)) time. 

By Proposition 1 the quasi e-covers produced by Lemma 3 are 2e-covering 
sets. However, their sizes are too large. The sizes can be decreased using the 
following thinning procedure. The proof appears in Appendix A.® 

Lemma 4. Let Q be a path in an undirected graph, and let v be a vertex. Let 
D(v,Q) be an ordered CQ-cover of Q w.r.t. v. For any ei < 1, a clean and 
ordered (2eo -|- ei)-cot;er C{v, Q) C D{v, Q) of size can be constructed in 

0{\D{v,Q)\) time. 

Thus, by combining Lemma 3, Proposition 1, and Lemma 4, we get the fol¬ 
lowing corollary, which, along with Lemma 2, establishes Lemma 1. 

Corollary 1. Given an undirected graph H and a shortest path Q, e-covering 
sets of Q with respect to all vertices of H, each of size 0(e“^), can be constructed 
in 0{e~^sssp{Q,H)\E{H)\ lg(|P((3)|)) time. 

3.2 The distance oracle 

The construction is recursive, using shortest path separators. 

Lemma 5. (Fundamental Gycle Separator [11]) Let FI be an undirected pla¬ 
nar graph with a rooted spanning tree T and function w assigning non-negative 
weights to edges. One can find an edge e such that neither the weight strictly 
enclosed by the fundamental cycle of e nor the weight not enclosed by the fun¬ 
damental cycle of e exceeds | the weight of H. 

A planar graph G can be decomposed by computing a shortest path tree for 
an arbitrary vertex, and applying Lemma 5 recursively. Choosing the spanning 
tree in Lemma 5 to be a shortest path tree guarantees that each fundamental 
cycle separator found consists of two shortest paths. The decomposition can be 
represented by a binary tree T in the following manner. ® See Figure 1 in the 
appendix for an illustration. 

® In [12] a thinning procedure is given only for the directed case, and it is claimed that 
a quasi-e-covering set can be thinned. We believe this is not correct. See Appendix B. 
Instead, we give here a thinning procedure for e-covering sets (not quasi-e-covering). 
® We refer to the vertices of T as nodes to distinguish them from the vertices of the 
graph G. 
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— Each node r of T is associated with a subgraph Gr of G. The subgraph 
associated with the root of T is all of G. 

— Each non-leaf node r of T is associated with the fundamental cycle separator 
Sr found by invoking Lemma 5 on Gr- 

— Each non-leaf node r has two children, whose associated subgraphs are the 
interior and exterior of Sr- The vertices and edges of the separator belong 
to both subgraphs. 

Let r be a node of T. The frame Fr of Gr is the set of (shortest) paths in 
IdGr), where the union is over strict ancestors r' of r in T. Each non-leaf 
node r stores its frame Fr- A standard argument shows that, by alternating the 
separation criteria between number of edges in the graph and number of paths 
in the frame, one can get frames consisting of a constant number of paths. 

For r G T, let G° denote the subgraph of Gr\ Fr- That is, G° is the graph 
obtained from Gr by removing the edges of the frame Fr as well as any vertices of 
Fr that become isolated as a result of the removal. The sizes of the G°’s decrease 
by a constant factor along T, while the sizes of the G^-’s need not because there 
is no bound on the size of the fundamental cycle in Lemma 5. This may pose 
a problem, since the frame Fr is stored by every node r. To overcome this, the 
algorithm stores the reduction of Fr to G° instead of Fr itself. 

Let u,w he vertices of G. Let Vu^rw be the leaves of T such that u G Gr^ 
and w G Gr„ ■ Let r be the LCA of and ry in F- Observe that u and w are 
separated by Sr- Hence, every u-to-w path in G must intersect Sr- However, a 
u-to-w path may or may not intersect Fy- See Figure 2 in the appendix. Suppose 
first that a shortest u-to-w path P (in G) does intersect Fr- We write P = PqoPi. 
Path Pq is a maximal prefix of P whose vertices belong to G°. We call this kind 
of paths type-0 paths. Note that type-0 paths start at a vertex of G°, end at a 
vertex of Fy and are confined to G°. Path Pi consists of the remainder of P, 
and is referred to as a type-1 path. Note that type-1 paths start at a vertex of 
Fr D Gy, end at a vertex of G°, but are not confined to G°. It is not difficult 
to convince oneself that, to be able to approximate ^g(w, ic), it suffices to keep, 
for every Q G Fy, connections G{u, Q) of type-0 (i.e. the connection lengths are 
relative to G°, not the entire G) and connections G{w,Q) of type-1 (i.e. the 
connection lengths are relative to the entire graph G). 

Now suppose that no shortest u-to-w path P (in G) intersects Fy - Then every 
u-to-w path P (in G) intersects Sy and is confined to G°. Then, to approximate 
P it suffices to keep, for every Q G Sy, type-0 connections G(u, Q) and G{w, Q)- 
The distance oracle therefore keeps, for each r G F, for each vertex u G G°'- 

f. connections G{u, Q) of type-0 for all Q G Fy- 

2. connections G(u, Q) of type-f for all Q G Fy- 

3. connections G(u, Q) of type-0 for all Q G Sy- 

These connections, over all u G Gy and all paths in FyUSy are called the (type-0 

or type-f) connections of r. In addition, the data structure stores: 

— A mapping of each vertex u S G to a leaf node Vy gF s.t. u S Gy ^. 


7 


— A least common ancestor data structure over T. 

The space bottleneck is the size of the sets maintained. Each vertex v belongs 
to G° for O(lgn) nodes r of T. For each of the 0(1) paths in the frame and 
separator of each such node r, v has a set of 0(e“^) connections. Hence the total 
space required by Thorup’s oracle is 0(e“^nlgn). 

We next describe how a query is performed. Given a u-to-w distance query, 
let r be the least common ancestor of r„ and in T. The algorithm computes, 
for each path Q of 5,. U Fr the length of a shortest u-to-w path that intersects 
Q using the connections C{u,Q) and C{w,Q) (of both type 0 and type 1). By 
construction of T, the number of such paths Q is constant. It is easy to see that 
computing the distance estimate for each Q can be done in 0{e~^) time. Thus, 
an (1 + e)-approximate distance is produced in 0{e~^) time. 


Efficient construction We now mention some, but not all the details of Tho¬ 
rup’s 0{e~^nlg^ n)-time construction algorithm. Refer to [12, subsection 3.6] for 
the full details. The computation of the connections and connection lengths is 
done top-down the decomposition tree T. Naively using Corollary 1 on G° for all 
r G T is efficient, but only generates type-0 connections on Sr- Using Corollary 1 
on Gr would produce type-0 connections on F^, but is not efficient since jUrl can 
be much larger than |G°|. Instead, For each path Q in Fr, the algorithm uses the 
reduction Q of Q to the vertices of Q that belong to G°. Let G^ be the graph 
composed of G° and Q. Note that |G^| = 0(|G°|). The type-0 connections on 
Fr can now be computed by applying Corollary 1 to G^. 

It remains to compute type-1 connections. Recall that these connection lengths 
reflect distances in the entire graph, not just in Gr- Clearly, applying Corollary 1 
on G for every r is inefficient. Instead, the computation is done top-down F us¬ 
ing an auxiliary construction. This construction augments G° with e-covers of 
the separators of all ancestors of r in T with respect to the vertices of G°. These 
e-covers have already been computed (type-0 connections at the ancestor), and 
represent distances outside Gr- Due to space constraints we defer the details to 
the next section, where we handle the more general case of vertex labels. 

4 Undirected Approximate Vertex-Label Distance Oracle 

The idea is to adapt Thorup’s oracle (Section 3) to the vertex-label case. Tho¬ 
rup’s oracle supports one-to-one (vertex-to-vertex) distance queries, whereas 
here we need one-to-many distance queries. Given two vertices u,v, Thorup’s 
oracle finds the LCA of and Vy in T, and uses its connections to produce 
the answer. In a one-to-many query, there is no analogue for v. We do know, 
however, that a shortest u-to-A path must intersect the separator of the leafmost 
node r in T that contains u and some A-labeled vertex. The node r takes the 
role of the LCA of r„ and Vy- In order to be able to use r’s connections in a 
distance query one must make sure that r’s connections represent approximate 
distances to A-labeled vertices in the entire graph, not just in G°. 


We define a set £ of new (artificial) vertices, one per label. For every r G T, 
let = {A S £|G° n 14 0} be the restriction of C to labels in G°. 

Simply connecting each vertex of V\ to an artificial vertex representing the 
label A is bound to fail. To see why, suppose vertices u and v both have label 
A. Adding an artificial vertex A and zero-length undirected edges vX and uX 
creates a zero-length path between u and v that does not exist in the original 
graph. While this does not change the distance between any vertex and its closest 
A-labeled vertex, it may change distances between a vertex and its closest X'- 
labeled vertex (A' 7 ^ A). Therefore, we would have liked to add, for each label A 
separately, a single artificial vertex A, and compute the connection sets G(A, Q). 
Doing so would result in correct distance estimates, but is not efficient. We 
show how to compute the connections C{X,Q) without actually performing this 
inefficient procedure. Instead of having a single artificial vertex per label, it 
is split into many artificial vertices (one for each incident edge). The problem 
with this approach is that the number of connections becomes too large (each 
split vertex has its own set of 0{e~^) connections). We use an extension of the 
thinning procedure (Lemma 4) to select a small subset of these connections and 
still get the desired approximation. 

Another point that we must address is that, for X G Cr, the type-1 connec¬ 
tions C{X, Q) should reflect the minimum distances between the connections of 
A on Q to the closest vertex with label A in G, not just to vertices with label A in 
G°. We show how to achieve this by an extension of the auxiliary construction 
used to compute they type-1 connections in Thorup’s unlabeled oracle. 

We start with the extended thinning lemma. 

Lemma 6. Let {ui} be vertices and Q be a shortest path. Given ordered e- 
covering sets {D(ui,Q)} it is possible to compute in linear time a clean and 
ordered 3e-covering connections set C of size 0{e~^) which represent approxi¬ 
mated distances from any q G Q to its closest vertex among {iti}. 

Proof. We first convert every connection length £{ui,Q) to reflect an approx¬ 
imated length from q to its closest vertex u* G {uj}, rather than to Ui. We 
obtain these lengths using the fact that q is e-covered with respect to u* by 
some connection in D(u*,Q). Let be the graph composed of the following. 
See Figure 3 in the appendix for an illustration. 

1. Q, the reduced form of Q to connections of all {D{ui,Q)}. 

2. vertices {ui}, along with edges between each m to its connections, with 

lengths equal to the corresponding connection lengths. 

3. vertex u, connected with zero-length edges to all {ui}. 

By the e-covering property, the distances between every q G Q and u in 
Zu represent approximate distances between q and its closest vertex u* G {uj} 
in G. To see this, assume q G Q is a, connection of ui, and is closest to u*. 
Let q* be a connection of u* which e-covers q w.r.t. u*. Then Sz,^{q,u*) < 
Sqiq, q*) + t{q*,u*) = Sciq, q*) + £{q*,u*) < (1 -f e)5G{q, u*). 

It is possible to compute all shortest paths from u in Zu in linear time; first, 
relax all edges incident to u and {ui}. Then, relax the edges of Q by going first 
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in one direction along Q and then relaxing the same edges again in the other 
direction. For connection p on Q, a, u-to-p shortest path first reaches Q along 
one of {ui} edges and then walks along Q toward p. Hence the relaxation was 
done in the correct order. We update the connection lengths to the distances 
thus computed. 

Let D{u, Q) denote the ordered union of all connections, along with the 
updated connection lengths. Since all {D{ui,Q)} were ordered, it is possible to 
order their union in linear time. Let Gu be the graph obtained from G by adding 
an apex u connected with zero length edges to all {ui}. We stress that Gu is not 
constructed by the algorithm, but only used in the proof. D(u, Q) is an e-cover 
of Q with respect to u in Gu- Now apply Lemma 4 to D{v, Q) with eo = ei = e 
to obtain a 3e-cover of Q with respect to u in G„ of size □ 


4.1 Vertex-label distance oracle for undirected graphs 

The vertex labeled distance oracle is very similar to the unlabeled one (Section 3). 
It uses the same decomposition tree T, and stores, for each r S T, the same 
covering sets. The only difference is that, in addition to the covering sets G{u, Q) 
for each vertex u G G°, the oracle also stores connection information for labels 
as we now explain. 

For every r G T and X G Cr, the oracle stores connections G{X,Q) of both 
type-0 and type-1. The type-0 connections G{X, Q) are connections in the graph 
obtained from G° by adding an artificial vertex A, along with zero length edges 
from all A-labeled vertices in G° to A. The type-1 connections C{X, Q) are con¬ 
nections in the graph obtained from G by adding an artificial vertex A, along 
with zero length edges from all A-labeled vertices in G to A. Before explaining 
how to compute these connections we discuss how a distance query is performed. 

Obtaining the distance from u to A is done by finding the lowest ancestor r 
of r„ with X G Cr- A shortest u-to-A path must cross Sr, and perhaps also Fr- 
The algorithm estimates, for each path Q of Sr U Fr, the length of a shortest 
u-to-A path that intersects Q, using the connections G{u, Q) and C{X, Q) stored 
for r (Since X G Cr, r does store Q-to-A connections). 

Finding r can be done by binary search on the path from to the root of T. 
The number of steps of the binay search is O(lglgn). Finding whether a node 
r' has a vertex with label A can be done, e.g., by storing all unique labels in 
G°, in a binary search tree, or by hashing. In the former case finding r takes 
0 (lglgnlg |L|) time, and in the latter O(lglgn), assuming the more restrictive 
word-RAM model of computation. 

It remains to show how the connections are computed. We begin with the 
type-0 connections. For every r G T, for every Q G Fr U Sr, the algorithm 
computes ordered e-covering sets of connections on Q w.r.t. each vertex of G° to 
Q by invoking Corollary 1 to G°. This takes 0(e“^|V(G°)| Ign) time (using [5] 
for shortest path computation). For each X G Cr, let n\ denote the number of 
A-labeled vertices in G°. The total number of connections to A-labeled vertices 
in G° is 0(e“^nA). The algorithm next applies the extended thinning lemma 
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(Lemma 6) to get a connections set C{X,Q) of size 0{e~^) in 0{e~^n\) time. 
Since “ C>(I^(G°)I)) the runtime for a single r and Q is 0{e~^\V{G°)\). 

We now show how to compute the type-1 connections without invoking 
Corollary 1 to the entire input graph G at every call. 

Lemma 7. Let r g7^. Type-1 connections for r can he computed using just the 
(type-0) connections of strict ancestors of r. Computing all type-1 connections 
for all r £'T can be done in 0(e“^nlg^n) time. 

Proof. Let Q be a path in Fr. Let X(? be the graph composed of the following: 
(see Figure 4 in the appendix for an illustration) 

— The vertices Cr 

— The vertices and edges of Q, the reduction of Q to V{Q) fl V{G').). 

— For each strict ancestor r' of r, for each path Q' G Sr-i, the vertices and 
edges of Q', the reduction of Q' to vertices that are (type-0) connections 
(in G),)) of Q' w.r.t. vertices in Q U Cr , along with edges representing the 
corresponding connection lengths. 

The algorithm creates a graph from X^ by breaking every artificial 
vertex A in X^ into many copies {Ae}, one per incident edge of A. We stress 
that the artificial vertices Ae are not directly connected to each other in X^. 
Hence, the problem of shortcuts mentioned earlier is avoided. See Figure 5 in 
the appendix for an illustration. 

Note that splitting vertices in this way does not increase the number of edges 
in the X^. The algorithm applies Corollary 1 to X() and Q, obtaining a small 
sized e-cover G(\e,Q) for every Ae. 

Let q be any vertex of Q, and let A be a label in G°. Let P be a shortest g-to-A 
path in G. Let r' be the rootmost strict ancestor of r such that Sr' is intersected 
by P. Note that r' must exist since q G Fr, so q belongs to the separator of 
some strict ancestor of r. Thus P is entirely contained in G),. Let Q' be a path 
in Sr' intersected by P. By construction of X^, it contains an e-covering set of 
connections of Q' with respect to q in G),, as well as the edges of Q' and an 
e-covering set of connections of Q' with respect to A in G),. Hence, by Lemma 2, 
there exists a shortest g-to-Ae path (for some artificial vertex Ae) in X(? whose 
length is at most (1 -|- e) times the length of P. On the other hand, because the 
vertices Ae (for any X G £r) are not directly connected to each other in X^, 
every path in X^ corresponds to some path in G, so shortest paths in X^ are at 
least as long as those in G. This proves that X(? correctly represents all desired 
type-1 connection lengths. 

We proceed with describing the construction of the connection sets of the 
appropriate sizes. To bound the size of the connections {C{Xe,Q)}, we count 
the number of edges incident to A in X^ (i.e., before it is split). There is an 
edge for each of the 0{e~^) connections of A on each of the O(lgn) paths of 
separators of ancestors of r. For each such edge there is a vertex Ae with an 
e-covering set of Q of size 0(e“^). Thus, the total number of connections of Q 
for all Ae vertices is 0{e~^ The algorithm applies Lemma 6, the extended 
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thinning procedure, to {C{Xe,Q)}^ to get C{X,Q) of size 0{e~^). Doing so for 
all labels in G° requires 0(e“^ Ign + e~^\Cr\) space. 

We now bound the running time. Since splitting vertices does not increase 
the number of edges, applying Corollary 1 to takes 0{€~‘^\V {G°)\ Ig^ n) time. 

Applying Lemma 6 is done within the same time bound. To conclude, the total 
runtime over all nodes of T is 0(e“^n Ig^ n). □ 

We have thus established our main theorem: 

Theorem 1. A (1 + ej-stretch {0{€~^n\gn)space > 0{\g\gn-\-e~^)time) Vertex- 
Label Distance Oracle can be constructed within 0{e~^nlg^ n) time w.h.p. ^ in 
an undirected planar graph with n vertices. 
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Appendix 

A Proof of Lemma 4 

Lemma 4. Let Q be a path in an undirected graph, and let v be a vertex. Let 
D(v,Q) be an ordered CQ-cover of Q w.r.t. v. For any ei < 1, a clean and 
ordered (2eo + €i)-cover C{v, Q) C D{v, Q) of size 0(ej"^) can be constructed in 
0{\D{v,Q)\) time. 

Proof. The proof is constructive. Let {q, v) be a connection with minimal con¬ 
nection length in D{v, Q). The vertex q splits Q into two subpaths, Qo and Qi 
. For each Q G {Qo, Qi}, the algorithm operates as follows. First, it adds {q, v) 
to C{Q ,v). The algorithm will now progress towards the other endpoint of Q . 
We say q semi e-covers q* if 6Q{q*,q) -\-iiq,v) < (1 e)i{q*,v). ® 

Let {q,v) be the last connection added to C{Q ,v). Let {q*,v) be the next 
connection of D{v, Q) that has not been considered yet. The algorithm adds 
{q*,v) unless q already semi ei-covers {q*,v). The algorithm returns C(v,Q) = 
C{Qo,v) U C{Qi,v). 

We first prove that C{v, Q) is a (2eo -l-ei)-cover. Let g be a vertex in Q. Let d 
be the connection in D{v, Q) which eo"COvers q. Let c be a connection of C{v, Q) 
that semi ei-covers d (it might be that c = d). 

We know that 

i^(q,c) < S(q,d) -hSgldyC) (triangle inequality) (2) 

SQ(d, c) £{c, u) < (1 -I- ei)£{d, v) (c semi ei-covers d) (3) 

S(q, d) -j- £{d, w) < (1 -I- eo)6{q, v) {d eo-covers q) (4) 

We have that: 6{q, c) £{c, v) < S{q, d) SQ{d, c) -f £{c, v) < 6{q, d) -f (1 -I- 

(2) (3) 

ei)£{d, u) < (1 -I- ei){6{q, d) £{d, v)) < (1 + ei)(l -I- eo)S(q, v) = (1 -h cq -h ei -h 

(4) 

< (1 + (2eo -f ei))S(q,v), and the approximation bound follows. 

ei<l 

We now turn to show the generated cover is of 0(e)"^) size. For Q G {Qo,Qi}, 
we show it is of size 0{ef^). Let {ci}i>i) of size k, be the chosen connections 
along Q , numbered by their order along Q toward the other endpoint t oi Q , 
starting with ci = q. We examine the function f{ci) = 5Qlt,Ci) £(ci,v). We 
observe that f{ci) - /(c^+i) = {SQ{t,Ci) -\-£ici,v)) - {SQ{t,Ci+i) £{ci+i,v)) = 
£{ci,v) SQ{ci,Ci+i) -£{ci+i,v) > ei£{ci+i,v) > ei£{q,v) > eiS{q,v). Thereby, 
/(ci+i) < /(ci) - ieiS{q,v), hence f{ck) < /(ci) - (k - l)eiS{q,v). Note that 
f{ck) = 5Q{t,Ck) -G£{ck,v) > 6{t,v) > SQ{t,q) - SQ{q,v). Using the lower and 
upper bounds over f(ck), we have that dqlt, q) — S{q, v) < fick) < /(ci) — {k — 
l)eiS{q, v) = Sglf, q)+£{q, v)-{k- l)ei6{q, v) < Sgit, g) + (1 + eo)S{q, u) - (fc - 
l)ei(5(d, v). Hence {{k — l)ei — (1 + £ 0 )^( 9 , v) < Sgi^q, v) and so fc < 1 -f ^ 7 ^. 
Therefore the size of the connections obtained over both {Qo, Qi\ is 0{ef^). □ 

® The semi e-cover definition is similar to e-cover definition. The only difference is 
that S{q*,v) was replaced by £(q*,v) for fast computation purposes. 
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B A flaw in Thorup’s treatment of the undirected case 

There is another notion of covering, apart from e-covering and quasi-e-covering [12]. 


Definition 3. q* strictly e-covers q w.r.t. v if S{q,q*) + £{q*,v) < S{q,v) + 
e6{v,Q). 


In [12], Thorup uses quasi-e-covers and strict-e-covers, but does not use 
(plain) e-covers.® Most of the discussion in [12] is devoted to the directed case, 
which uses yet another notion of covering. When treating the undirected case, 
Thorup claims that all lemmas, except for the efficient construction procedure, 
carry over from the directed case to the undirected case when the directed defini¬ 
tion of e-covering is replaced with strict e-covering. The treatment of the efficient 
construction for the undirected case is more detailed, where a procedure for effi¬ 
ciently constructing quasi-e-covering sets is given (Lemma 3, [12, Lemma 3.18[). 

We believe that the treatment of the undirected case in [12[ suffers from two 
flaws. First, the proof of the thinning procedure does not seem to carry over from 
the directed case to the undirected case when using strict e-covers. Second, since 
the construction is of quasi-e-covers, whereas all other parts of the undirected 
oracle in [12[ assume strict-e-covers, the correctness of the entire oracle is not 
established. 

Our algorithm does not use strict e-covers at all. We use Thorup’s efficient 
construction of quasi e-covers, which, by Proposition 1 is also a 0(e)-cover, and 
prove that the thinning procedure and query algorithm work for e-covers. 

C Vertex-Label distance oracle for directed planar graphs 

Thorup shows that the problem of constructing a distance oracle for a directed 
graph can be reduced to constructing a distance oracle for a restricted kind of 
graph, defined in the following. 

Definition 4. A set T of arcs in a graph H is a {t, a)-layered spanning tree if 
it satisfies the following properties: 

— Disoriented - it can be oriented to form a spanning tree of H. 

— Each branch (a path from the root of T) is a concatenation of at most t 
shortest paths in H. 

— Each shortest path in a branch of T is of length at most a 

Definition 5. A graph H is called {t,a)-layered if it has a (t,a)-layered span¬ 


ning tree. 


Definition 6. A scale-{a,e) distance oracle for a (t,a)-layered graph El is a 
data structure that, when queried for 5 h{v,w) returns 



d € [5h{v,w),6h{v,w)-\- ea] SHiv,w)<cx 


oo 


otherwise 


® Thorup did not use the terms strict and quasi. 
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The reduction is summarized in the following lemma. Let N be the maximum 
length of an arc in G. 

Lemma 8. (fl2, Section 3.1,3.2,3.3]) Given a scale-{a,e') {0{s{n,e'))space > 
0{t{e'))time) algorithm, a (l+e)-stretch {0{s{n,e)\g{nN))space ; 0(t(|) Iglg + 
i(f ))tzme) algorithm can be constructed to any input graph. 

Thorup shows each graph can be decomposed to a (3, a)-layered graphs, of 
total linear size (with a the distance bound of the graph). Therefore its suffices 
to show how to construct a scale-(a, e) distance oracle. 

To do so, Thorup shows a directed variant to Lemma 1: 

Lemma 9. Let Q be a shortest path in a directed graph H. There exist sets 
C{u,Q) of 0(e~^) vertices of Q for all u € H, where: 

1. C{u, Q) are called the connections of m on Q. 

2. The distance between u and a connection q G C{u, Q) is called the connection 
length of u and q. 

3. For every u,w G H, if a shortest u-to-w path in H intersects Q, then 

5h^'-{u,w) < 5h{u,w) + ea. 

Here Hq^ is the graph with vertices u, w, and the vertices of the reduction 
of Q to C{u,Q) U C{w,Q), and with u-to-Q and Q-to-v arcs whose lengths 
are the corresponding connection lengths of C{u,Q) and C{w,Q). 

The e-covering definition in the directed case is not the same as in the undi¬ 
rected. It uses a 0{ea) additive error to the approximation rather eSH{u,w). 
Moreover, the u-to-Q cover and w-from-Q cover are different (because of the 
directedness) but obtained in similar manner. 

As in the undirected case, given a (3, Q;)-layered graph, the algorithm keeps 
a recursive graph decomposition fT. The distance oracle keeps, for each r G H, 
for each vertex u G G°: 

1. connections C{u, Q) of type-0 for all Q G Fr. 

2. connections C{u, Q) {Q to u distances) of type-1 for all Q G F^-. 

3. connections G(u, Q) (separate Q to u and u to Q distances) of type-0 for all 
Q G Sr. 

A u-to-w query is done similar to the undirected case, using their relevant 
connections found in the LCA of r„ and . The construction of the connections 
is similar to the undirected case; type-0 connections are computed in G° where 
the frame is reduced to the vertices of G° (using Lemma 1). type-1 connections 
of node r G H are computed in a graph augmented by type-0 connections of 
ancestors of r (see [12, Section 3.6] and Lemma 7 of the undirected case). 

For the vertex-label case, for any r G T we define G° be G° along with 
vertices Cr and arcs from any A-labeled vertex to A for each vertex X G Cr. 

The vertex-label distance oracle keeps, for each r gT, for each vertex u G G° 
(either in G° or artificial vertex from Cr): 

nN is an upper bound on 5 g(')- 
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1. connections C{u, Q) of type-0 for all Q G Fr- 

2. connections C{u, Q) (Q to u distances) of type-1 for all Q G F^. 

3. connections C{u, Q) (separate Q to u and uto Q distances) of type-0 for all 
Q G Sr- 

A M-to-A query is done similarly to the vertex-to-vertex case relatively to the 
u and A connections of the lowest ancestor of with X G Cr- The construction 
is similar to the vertex-to-vertex construction, but instead done over G°. See 
Figure 6 for the type-1 connections construction directed variant. 

We thus get the following theorem: 

Theorem 2. A (1 + e)-stretch {0{e~^nlgnlg{nN))space , 0(lglgnlglg (nA^) -I- 
Vertex-Label Distance Oracle can he constructed in 0{e~^n Ig^ n \g{nN)) 
time w.h.p. for a directed planar graph with n vertices and maximum arc length 

N. 


The probability in the construction time is only due to the use of perfect hashing. 
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Fig. 1: An illustration of the decomposition tree T. The root r is associated 
with Gf = G. The children of f, r'o and fi, are associated with the interior and 
exterior of the separator Sf of Gf. 



Fig. 2: The solid lines (thin and thick) indicate Fr, the reduced frame of Gr- The 
bold lines (solid and dashed) indicate Sr, the separator of Gr- Vertices u and w 
are vertices of G° separated by Sr- Every u-to-w path must intersect Sr- The 
dashed line shows a possible shortest u-to-w path in G. 
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Fig. 3: Illustration of the situation in the proof of the extended thinning lemma 
(Lemma 6). Each vertex in {ui} has different connections on Q. The distance 
from ui to any connection of U 2 is approximated using the connections of ui. 



Fig. 4: The figure illustrates a part of X'^. In this example Q is a path in the 
separator of the parent of r (in general Q is a path in , so it may belong to the 
separator of an ancestor of r). The vertices of G° are enclosed in which is 
represented by the dashed lines and by Q. The vertices ui and U 2 are A-labeled 
vertices of G°, and are not part of . Paths from Q to A-labeled vertices such 
as Ui and U 2 are represented in by edges between Q and A. These edges 
correspond to the type-0 connections of A on Q in the parent of r. All solid 
edges are part of X^. A shortest path from gi S Q to A G is approximated 
by connections from qi to a separator of an ancestor of r and from there to A. 
Note that G{X,Q') represent distances from Q' to a A-labeled vertex which is 
not necessarily in G°. 
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Fig. 5: Illustration of the utility of splitting an artificial vertex A. On the left {X^ ) 
undesired shortcuts (teleportation) might occur. On the right (X^) teleportation 
does not occur. 



Fig. 6: Illustration of part of the auxiliary graph X^ for the directed case. No 
teleportation can occur because each artificial vertex A has only incoming arcs, 
and no outgoing arcs. 
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